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ABSTRACT 


The TBM* Memory System is an on-line, high density memory 
system with a capacity of up to three trillion bits. It isa 
cost efficient method of storing large data bases on-line with 
an effective response Similar to disc storage. 

Some features of the TBM* Memory System are: 1) Complete 
system redundance; 2) Modular expansion; 3) Hardware and data 
transfer reliability; 4) Large data base on-line; 5) Rapid 
response; and 6) Low cost per bit. 

Major applications of a mass storage system are: 1) The 
replacement of large conventional on-line and off-line tape 
library systems; 2) Supplements to enlarge disc memory systems; 
and 3) the provision of on-line storage for large data bases. 

Since the TBM* Memory System is indeed very complicated, a 
Simulation model was developed to help the user understand the 
system and to predict the TBM* performance under various con- 
figurations and loading conditions. In order to help the user 
select the proper configuration for his installation, a 


sensitivity analysis is also provided. 
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I. INTRODUCTION AND OBJECTIVES 


A. INTRODUCTION 

More companies are converting their computer facilities 
to a centralized system with remote data entry locations. 
This trend tends eS create very large data bases. Thus the 
demand for a quality mass storage system continues to increase. 
As companies continue to centralize, the number of instal- 
lations with a mass storage system is definitely going to 
increase in the near future. 

The TBM* Mass Storage System was selected as a research 
topic for several reasons: 


1. The system is past the theoretical and development 
Stages and into marketing and production stages. 


2. Much more information, especially technical manuals 
and sales literature, was available. 


3. The base location for the TBM* Memory System is a 
division of Ampex Corporation in Sunnyvale, Cali- 
fornia, which allowed many trips for interviews. 

The Navy has many large data base applications, both as disc 
and tape libraries, which are reasonable candidates for a mass 
memory system. This research on performance characteristics 
of the TBM* Memory System should be of use to the Navy in the 
near future, and the knowledge and experience gained during 


the research should be of value in the writer's future shore 


assignments as a Naval officer. 


B. OBJECTIVES 
The TBM* Memory System apparently is a cost efficient 


method to store a large data base and still have the 
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effective access time of disc. Each installation has their 
own unique storage and performance characteristics. 
The primary objectives are: 


BS To vgive both sa brief amd an in-depth description of 
the TBM* Memory System. 


2. To verify the hypothesis in the above paragraph. 

3. To analyze performance characteristics of the TBM* 
System under varying system configurations, host 
loading factors, data arrangement criteria, etc. 

4. To define applications where mass storage memory 
systems are practical, in particular, applications 
in which the TBM* Memory System could be used. 

After reading the section entitled "The TBM* Memory System," 
the reader should understand the basic features and advantages 
of the TBM* System. After reading the section entitled "The 
TBM* Memory System Model," the reader should know how the 

TBM* Memory System operates and how the model represents the 
system. After reading the analyses, concluSions and recommen- 
dations, the reader should be able to infer what configuration 
of the TBM* Memory System would best suit his purposes. The 
appendices give the reader a more detailed understanding of 


various sections of the TBM* System, its uses, and other 


points of interest not included in the main body of the thesis. 





i elie ee EMORY SYSTEM 


The TBM* Memory System SeeMeSeture ana technology re- 
flect a significant departure from conventional approaches 
to the problems of data storage and retrieval. Data is 
stored on magnetic tape, but this is where the similarity 
to conventional tape oriented devices ends. TBM* is a 
massive digital memory system with a maximum capacity equiv- 
alent to that of 30,000 computer tapes (800 bpi). TBM* is a 
SYSTEM; it is self-contained and capable of operating inde- 
pendently of other processors. It can be used in a stand- 
alone configuration or it can be interfaced to commercially 
avallable computers to act as an auxiliary memory system to 
the host computer. 

TBM's unique system architecture provides such features 
as rapid random access to any record, extremely high data 
throughput (more than four times that of an IBM 2314 disc 
drive) and many more features, at a cost low enough to make 
the TBM the economical solution to massive data storage and 
retrieval problems. 

TBM* utilizes standard two inch wide magnetic video tape 
of the type used in commercial television. Data is recorded 
in a block format with each individual block identified by a 
unique address. Rapid random access to any record is achieved 
by performing block address searches at tape speeds of 1000 
ips - both forward and backward. Because of the high search 


speed and because of the high packing density, this corresponds 
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to searching through the equivalent of 10 conventional com- 
pueer tapes per Second. The system allows up to six simul- 
taneous tape searches - the equivalent of searching 60 
computer tapes per second. 

Functionally, the system can be divided into two parts: 
the Data Storage Section, DSS, and the Communication and 
Control Section, ccS. The DSS consists of one or more of 
each of the three basic building blocks: Dual Transport 
Modules, transport drives and data channels. A system must 
have one each of the basic building blocks, while more units 
can be added to provide desired system capacity and through- 
put characteristics for each application. The CCS contains 
the TBM Supervisory Control Processor, SCP, External Data 
Channel Processors, EDCP, Staging discs and channel switches. 
The SCP is the control mechanism for the TBM system. It also 
handles host computer I/O requests, controls interface buffers, 
staging discs and switches, and monitors the overall system 


status. 


A. ADVANTAGES OF THE TBM* MEMORY SYSTEM 
iP: Data Base Consolidation 
TBM is the economical approach to consolidation of a 
data base into a single integrated and meaningful structure. 
It can store up to 400 billion bytes on line - the equivalent 
Of 8.2 terion bits. In ordér to judge the ré@ative size of 
a trillion bits of data, the following comparisons are given. 


A trillion bits is equivalent to: 
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2,900 1600 bpi fully recorded 2400' tapes, or 
35,5010 2314 disc packs, or 
1,000 3330 disc packs, or 
950 Fastrand II drums, or 
250 2321 data cells (400 Mbyte Units) 
2. File Maintenance 
TBM's approach to data handling enables users to 
access, process and update files faster and more frequently 
than with conventional techniques. With TBM's access methods 
and buffering techniques, only the desired records need be 
sent to the host computer for processing. (See Appendix E). 
3. Real Time Applications 
A multilevel priority processing structure, together 
with concurrent random access to multiple files make TBM an 
element of on-line real-time data communication system. 
(See Appendix F). 
4. Scheduling Flexibility 
TBM can facilitate job scheduling. There is no need 
to worry about retrieving and mounting tapes or disc packs - 
with TBM the data is available on-line. 
5. Data Security 
Protection eos accidental overwriting is provided 
through read-only interlocks. File access protection is imple- 
mented via user codes associated with individual blocks or 
entire files. Backup data protection is provided by simul- 
taneously writing data from a single EDCP through data channels 


to two topplates (tape mounting units). TBM tapes can also 


be copied independently of the host computer for further backup. 


12 





The TBM* Memory System provides an economical means 
of keeping several copies as well as several generations of 
files on-line. A Single TBM tape is also an off-line storage 
unit containing the equivalent of 500 computer tapes at 800 
bpi and fits in a space the size of a three inch binder. 
6. System Reliability 
High reliability is achieved through extensive com- 
ponent redundancy, multipath access and dynamic TBM device 
reconfiguration. See Appendix B for a more in depth expla- 
nation of data reliability and system reliability. 
7. Off-Line Operations 
System components may be placed in the off-line mode 
if disconnected from the host computer. The off-line mode 
1s used for performing system diagnostics to locate problems, 
for preaddressing and testing tapes for routine maintenance, 
and for file duplication for backup protection. These 
Operations are performed with a special off-line language and 
do not require the host CPU, the staging discs or the EDCP 


(External Data Channel Processor). 


B. MODULARITY OF THE TBM SYSTEM 

From the basic system, Figure 1, containing one each of 
the building blocks, the system can be expanded both in 
Capacity and performance. Up to thirty-one Dual Transport 
Modules can be added to provide on-line storage equivalent to 
more than thirty thousand standard computer tapes. To provide 


for many Simultaneous activities, up to six Transport Drivers 
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can be added. These make possible up to six concurrent 
random accesses (Figure 2). With added Data Channels and 
another External Data Channel Processor, two simultaneous 
read/write commands may be executed, yielding 1.5 megabytes 
per second data throughput. (Note: The original system 
proposed by Ampex was Shee a throughput of 4.5 megabytes 
per second, but this was later limited to 1.5 megabytes due 
to the capacity of selected computer for the EDCP.) Switch- 
ing matrices permit any Transport Driver and Data Channel to 
access any topplate. As the system expands, so do the switch- 
ing matrices. This approach allows multipath access, dynamic 
reconfiguration, and optimum device scheduling, which assures 
a continuous flow of data, even when part of the system is 
taken off-line for maintenance. Figure 2 illustrates the 


maximum expansion of the TBM* Memory System. 


Much of the above information was derived from Ampex's 


technical manuals, References 1, 2, and 3. 
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Ric. “LEM SMEMORY SYSTEM MODEL 


A. OPERATIONAL MODEL 

The Model parallels the management's view of the TBM* 
Memory System (see Figure 3) and yet incorporates many engi- 
neering technicalities. This section describes the develop- 
ment of the TBM model. The System was first examined from 
the descriptions in Ampex's sales and technical literature. 
Several trips to Ampex for day-long interviews revealed that 
the initial proposed system and the adopted system were two 
different things. By combining technical information from 
the manuals and the new or updated information from the 
interviews, an initial logical flow chart of the system was 
made. This was presented to one representative of the manage- 
ment of the TBM* Memory System research department and one of 
the development engineers. After many hours of clarifications, 
the model was approved by Ampex as accurate to the degree 
necessary for simulation and sensitivity analysis. The 
approved detailed model (see Figure 4) was similar to and 
eWeee core used in place of a program flow chart for developing 
the simulation model. 

The next step was to program the detailed model ina 
Simulation language. In addition to the flow chart, the 
following variables, with values suggested by Ampex, were 


used in the development of the simulation model. 
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Figure 3 
OVERVIEW OF THE OPERATION OF THE TBM* MEMORY SYSTEM 


dig 


Host sends data set requests to 


.TBM queue by job priority. 


SCP looks up address, builds 
each topplate's search queue by 
data set location on tape; host 
puts jobs on wait queue. 


TBM Searches tape and prepares 
fOr wead. | 


Read from tape through data 
channel and EDCP to disc. 


Host processes jobs. 


Host outputs write jobs to 
queue and signals SCP. 


e 


“SCP searches and prepares for 


write. 


Write from disc through EDCP 
through data channel to tape. 
Verify and tally. 
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Figure 4 
LOGICAL FLOW CHART OF TBM* MEMORY SYSTEM 
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located on a device within 
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Assign device type. 

Send data set requests to 
SCP. 
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status on host queue. 
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Arrange by priority. 
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Send a message to mount 
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Figure 4 (Continued) 


EXPLANATION OF STEPS THROUGH THE TBM* MEMORY SYSTEM 


Jobs are received by the host with the following parameters: 
A. Interarrival mean and distribution. 

B. Size of each data set request. 

C. Number of data sets per job. 

me frvority of job. 


Is the data set located on any device within the host's 
system? 


If the data set 1s resident in the host system, then the 
host accesses the data set. 


If the data set is not in the host's system, then the 
host signals the TBM SCP over a control link (similar 
to having a write-to-operator request on the console 

to have a certain tape mounted). The data set requests 
and job priority are sent to the SCP. 


The host puts the user program on an inactive status 
and does other procesSing. 


SCP accepts data set requests and queues them by 
priority. The SCP builds the queue of requests until: 


A. Host finishes read/initiate. 
B. Queue reaches maximum number. 


C. Maximum time limit exceeded by a request. 


SCP sends requests with highest priority to the file 


girecconrny first. 

SCP searches the master file directory which contains 
the status of all data set recorder in the TBM* memory 
system. 

Is the requested data set on line? 

If the TBM tape has been taken off line, then send a 


message to the operator to retrieve the tape and put it 
on line. 
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Figure 4 (Continued) 


If the tape is on line, then the search continues until 
the data set discrimination block is retrieved. It 
includes the: 

A. Tape volume serial number. 


B. Disc space required. 


All the requests for each topplate are queued by 
ascending order of addresses. 


The SCP selects a topplate with a queue: 

A. Of length N. 

B. Over maximum time limit. 

C. Containing the highest priority requests. 


Also, the SCP sends to the host the disc space required 
to contain the data set (multiples of TBM blocks). 


Is sufficient disc space available? 


If it is not available, then TBM cannot initiate a read. 
Disc space will be released by the host in the case of 

read-only data sets and, if necessary, TBM can initiate 
a write of the write queue which would free disc space. 


Host sends to SCP the location where to put the data on 
disc. SCP must be ready to have the channel simulator 
modify the home address of the data during the data 
transfer. 


ms che) tepplate busy with previous requests? 


If the topplate is busy, the requests for that topplate 


Wait, but parallel processing of other topplates 
continues. 


If the topplate is not busy, the SCP assigns the first 
available transport drive. 


If all transport drives are busy, then wait until one is 
released. 


Connect a transport drive and correct topplate. (This 


can take from 0.5 to 1.9 seconds depending upon the 
Sacuacion. ) 
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Search and preposition 
tape for read. 


Transport drive signals 
SCP ready to read. 


Are all data channels 
busy? 


Transport drive and 


topplate stay connected 
and wait for data channel. 


SCP does other procesSing. 


SCP connects transport 
drive and topplate to 
data channel. 


Is disc controller busy? 


Wait until it is available. 


Read from tape through 
data channel through EDCP 
tomas. 


SCP to host data available 
SCP disconnects TD, EDCP, 
DC and topplate. 


Host processes data. 
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Figure 4 (Continued) 


Search begins at the lowest address of the queued 
requests. Once the address is found, then the trans- 
port drive pre-positions the tape in preparation for 
the read. 


The transport drive sends a signal to the SCP when the 
pre-positioning is complete. 


The transport drive and topplate logically become one 
unit and together they request a data channel. 


If a data channel is not available, then the transport 
drive and data channel wait. 


SCP does parallel processing while waiting for a data 
channel to signal its availability. 


SCP connects the transport drive and topplate to an 
available data channel. 


Is the disc controller busy? 
If not, wait until it iS available. 


Initiate the read. The data travels from the topplate 
maArough ;: 


A. The data channel for error detection and correction 
and bit conversion, 

B. External data channel processor which acts as a 
buffer to compensate for the different transfer 
rates of disc and TBM* memory system, 


C. Channel simulator to change the disc home address, 
if necessary, 


D. To disc in the space designated by the host. 

SCP signals the host that the data is now available on 
disc at a deSignated area. The SCP disconnects the 
transport drive, the external data channel processor, 
the data channel, and the topplate. 


The host processes the data. The host processing time is 
user dependent. This is not of concern to TBM except: 


A. Write to TBM. 


B. Filling of disc space. 
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, Figure 4 (Continued) 


33. Host finished processing 
OS outputs data to disc. 


34. Was request for read only? 








Yes | 
Be 35. Release disc space. 
No 
36. Queue writes on disc by 
36 tape location. 
37. Is more disc space needed? 
38. Hold write queue until 
inactive time, then write. 
Yes 


39. Force a write of queue 
onto a write tape. Verify 
and update tally. 


40. Release disc space. 





41. Hold write tape until in- 
active time, then write to 
,correct Becation frem write 
tape. Note: If data set is 
needed before update from 
write tape, a search of 
write tape at 83 IPS is 
possible. 
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Figure 4 (Continued) 


Host finishes processing jobs and writes to disc, if 
appropriate. 


Is the request for read only? 
The host releases the disc space. 


If writing to TBM*, the queue arranges all write requests 
on disc and either writes them now or later depending 
upon management's policy. 


If more disc space is needed even after releaSing read 
only space, then initiate a write. 


If the disc space is not needed just now and the proc- 
essing load is large, then wait for an inactive time of 
the day to write the data from disc to TBM*, 


If a write is necessary to free disc space, then a 
faster method for clearing the disc is to write to a 
dump tape rather than updating the original files during 
a busy period of the day. 


Release the disc space after the write and verify is 
completed. 


Hold the write tape until an inactive period of the day, 
then update the TBM* master tapes without tying up host, 
discs, or EDCP. 


= 


Note: If an updated data set is needed from the write 
tape, the host should keep a write tape file directory. 
Then a search in the usual manner could be made. This 
is also useful if the write dump tape is to be used as 
a backup for security. 
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Host Controlled Variables 


1) 


2) 


3) 


4) 


3) 


6) 


7) 


1) 


2) 


3) 


Job Arrival Rate 


Jeb Arrival 
Destributiorw 


Job Priority 


Number of Data 
Sets/Job 


Size of Each 
Data Set Request 


Number of Disc 
Controllers 


Number of Disc 
Drives 


Number of Trans- 
port Drives 
Available 


Initial Tape 
head location 


Batch 


Ave 80/hr, range 60-120 hr; 
CP (time sharing)-ave. 20/hr; 
Initial sign on - 20/hr; 
Range 0 - 25/hr 


Poisson - with above average; 
constant - (1.e., 1 job/min); 
Immediate - 20 jobs arrive 
together 


The priority ofea job only 
influences its relative 
position when the job is 
released into the TBM* System; 
after being released into the 
system the job's priority has 
no effect. 


CPa= 2. .Bateh - 2,5, 8. If 

5 or 8 data sets are requested 
then usually 2 of them are very 
large. 


CP - Average 512 K bytes. 

Batch - Average 2.5 megabytes; 
Range 128 K bytes - 30 megabytes; 
See Figure 5. 


Single Access - 1 controller 
Dual Access - 2 controllers 


Host dependent 


TBM* Controlled Variables 


Minimum System of 1; 
Maximum System of 6 


' Mid tape; 


Beginning of tape; 

Random location. 

The initial bias was eliminated 
before gathering statistics 

on each run. 


Search speed is dependent upon search distance. 
See following section on search speed and times. 
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4) 


>) 


6) 


7) 


8) 


9) 


10) 


11) 


Number of Data Minimum System - 1 
Channels Maximum System - 3 
Number tested - l, 2 


Number of Super- Minimum System - 1 


vols@ry.. Coniteso] Maximum System - 2 
Processors Number tested - l 
Number of Dual Minimum System - 1 
Transport Maximum System - 36 
Modules Number tested = 1 - 36 
Per cent of Read- Any percentage possible. 
only Requests Tested 75% 

Time for File Average 0.1 seconds. 


Directory Search Range 8.5 - 1000 milliseconds. 


Time for search and preposition of the tape is 
dependent upon distances to be searched. See 
following section on search speeds and times. 


Job priority queuing techniques before 
releasing to TBM System. 


ay BWengehmot quétc. 
b) Time spend in queue. 


Location of Data Sets on Tape: 


a) Random - Assume all types of data sets are 
located uniformly over the whole tape. 

b) Size - Arrange data sets by size. The larger 
the data set, the closer to either end of the 
tape. 

c) Activity - the more active the data set, the 
closer to the middle of the tape. 

ad) Size/Activity - A combination of the above 
arrangement. 


3. Write Options 


1) 


2) 


Replacement of Write Updates 


a) Into final destination. 
b) Onto Write Tape. 


Release time of write updates. 
a) Queue Write until 
1) Disc limitation force write 


2) Slow period of day 


Pas 





b) Send to TBM System as released by host. 


B. SIMULATION MODEL 
1. Programming Language 

The model is written in General Purpose Simulation 
System language (GPSS). The complete program is in Appendix 
I. The model is programmed to include the engineering tech- 
nicalities while following the management's operational flow 
chart as described previously in Figure 4. GPSS is an event- 
Oriented discrete simulation language. All the model's events 
are paralleled by GPSS blocks. To make the model more 
accurate, the time units of the model were tenths of seconds. 

2. Search Speeds : 

The search time for locating a particular TBM block 
is based on the following assumptions. If the topplate is 
waiting for the search signal, then the tape is not moving. 
When the signal to search at 1000 IPS arrives, there is no 
delay waiting for DTM (Dual Transport Module) to connect to 
the TD (transport drive) since that operation was done 
previously. The acceleration to 1000 IPS begins and for 
eieeases of the Simulation is assumed to’ be constant. The 
time required to reach 1000 IPS is 1.2 seconds. The error 
introduced into the model by the assumption of constant 
acceleration is trivial when compared to search times. The 
statistics for deceleration are the same SS eee Or 


acceleration. 


30 
















— <« =7v ' eo a @ Oo => & © 

———— —_ 

9 Se | GE ates 

—_ oe) esi 6 

eo wee eee & wae 

ss = <= hil” i,» _- 
S—- > SD aa aa 

aaa ll ti aii a 
ao. — _—————-> = 

- ————_ & = > ———_— m= 

~_Y~ =— => | a2. Ca aaa 

aa-= § Gams «€ «  @ & 

® =— Oe 4 yy ae —a « 

”- @@D em ae >> aa” ™- = 


= . 





a. Mathematics of Search Time 
A constant acceleration from 0 to 1000 IPS, or 
deceleration from 1000 IPS to 0, in 1.2 seconds means that 


600 inches were covered at an average rate of 500 IPS. 


Average Velocity 1000 IPS = 2 = 500 IPS; 


Distance Covered = 1.2 seconds X 500 inches = 600 inches. 
seconds 


Since 1 TBM Block = 0.9752 inch or 

l inch = 1.02543 TBM Blocks, 

Distance Covered = 600 inches X 1.02543 Blocks = 

Inen 

615.26 Blocks 
So 615 blocks are travelled in reaching 1000 IPS or de- 
celerating from 1000 IPS to stop. Every search has an 
acceleration period and a deceleration period. If the search 
speed reached 1000 IPS, then 1230 blocks were covered at 
some speed less than 1000 IPS during acceleration and de- 
celeration periods. | 

Since acceleration is constant, it can be computed: 


* 


ACC = 1000 IPS = §33.33 IPSs? 
1.2 seconds 


The maximum velocity attainable is 1000 IPS. If 
the search distance is less than 1230 blocks, then 1000 IPS 
will not be reached and the search time is calculated as in 
the following example. 

Example: Search distance = 1000 blocks. 


Accelerate for 500 blocks and decelerate for 500 blocks. 
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Maximum velocity will be less than 1000 IPS and is computed 
as follows: 

Velocity Maximum = Constant Acceleration Rate X Time of 
Acceleration 

The tape is accelerated for 1/2 of the total time of travel 
and decelerated the other half. (There was no period of 
constant velocity since 1000 IPS was not reached.) 


Velocity Maximum = Acc. X time 
2 


833.33 X time 
Zz 


The distance travelled equals 1000 blocks, 
according to this example. The distance travelled is also 
equal to average velocity X time of travel. Average velocity 
equals velocity maximum = 2: (Note: This is not true if 
1000 IPS attained.) 

Vaverage ¥ Vinax 7 2. 


Substituting and solving for time gives: 


; Acc X m2 


; [4p 4D D< 1230 
T = VACC = Vaaiaiooe 


therefore, when search distance is less than 1230 blocks, the 


D 


time will be computed from the above equation. 

When the distance is greater than 1230 blocks, 
then 1000 IPS will be reached. The acceleration time plus 
deceleration time equals 2.4 seconds and the distance 


covered is 1230 blocks. Any search distance greater than 
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1230 blocks will have an acceleration period of 1.2 seconds, 

a deceleration period of 1.2 seconds, and a constant velocity 

period determined by search distance, D. (See Figure 6). 
Time to search greater than 1230 blocks is 

given by: 


Time = 2.4 + D= 1230 , D 3 1230 Blocks 
1000 


Note: The error induced by the assumption of constant 
acceleration and deceleration is trivial when compared 
to search times. Appendix C has an explanation of search 
speeds. 
3. Major Time Periods 
The simulation model is built for a typical case 
borrowed from Ampex* under peak loads. Any special situation 
can be included with minimal changes but attempting all 
special situations is infeasible. The major time periods 
of the model are as follows: 
1. Jobs requesting data sets enter TBM* and wait until: 
A. Enough jobs have arrived; 
B. Some job has been in queue a maximum time limit. 
While waiting, jobs or data set requests are 
arranged in a queue by priority. Each job has 


one or more data set requests. 


rr. Transactions or data set requests enter queue for a 
particular topplate and wait until: 


A. Enough requests for that topplate exist (1i.e., 10); 
B. Some request has been in over maximum time limit. 
et Transactions for a topplate enter a queue waiting for 


a transport drive. After being allocated a transport 
drive, there is a short delay for connections. 
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IV. 


vel . 


Ven ae 


ertt. 


Tz. 


Transactions for topplate sequentially advance 
through Steps V to VII below. 


Search time for each transaction assumes constant 
acceleration and deceleration (see Chapter III-B-2 
above). 


A. Distance < 1230 TBM blocks, T =./ 4D = _/ 4D 
ACC 833 


B. Distance S&S 1230 TBM blocks, 


T= #£ Of "blocks - 1230 + 274 
1000 


When each transaction is found then a Data Channel, 
an External Data Channel Processor, and a disc 
controller must be obtained. Note: the disc space 
Should have already been allocated. The devices are 
released after the reading operation, except the 
transport drive is not released. 


Read time for each request is composed of two constants 
A. Preposition time, 250 milliseconds; 

B. Tally update time, 200 milliseconds. 

and a variable dependent upon data set size, i.e., 

0.19 sec./TBM block. (Note: the above three steps 

are repeated for all transactions waiting for that 


topplate.) 


Host processing time is not of concern to TBM* Memory 
System except for: 


A. Write entries into TBM* (for rewriting updated 
records); 


B. Disc space available. 


Write time affects system only if writes are necessary 
during busy periods. This would happen only 1f: 


A. Management wanted files updated immediately; 

B. Not enough disc space is available to hold write 
queue. It is practical to put all writes ona 
temporary write tape for later updating of the 
master file. 


Updating speeds are dependent upon number of blocks to 
be updated: 
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A. Erase: 5 or 83 IPS; 
By... ealktty ; a3 eS: 
C. Verify 5 IPs, 


Note: Appendix C has an in depth explanation of 
hardware time periods. 
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IV. ANALYSIS 


or olteM RESPONSE TIME VERSUS TRANSPORT DRIVES 

This section describes the average system response time 
and how it varies with number of transport drives. First, 

a perceptive conclusion is derived from Figure 7. The 
average system response time varies inversely with the nunm- 
ber of transport drives available, and directly with the 
input rate. The next step is to find out where and why 
serious delays are encountered in the system. In examining 
the delays, the following variables are held constant. 1) 
Data sets were uniformly arranged over the tape with no bias 
as to size or activity. 2) The queue minimum level before a 
search was initiated was one. This tested the demand on the 
transport drives as the requests arrived. If more requests 
arrived after the first one and before a transport drive 
could be secured, then the requests were arranged in sequen- 
tial order by physical location. The sequential ordering 
reduces the average tape search time and therefore the 
average system response time. 

The first section will find the delay caused by the non- 
availability of transport drives. Figure 8 illustrates the 
average time spent waiting for a transport drive. The 
Similarity of the average system response time in Figures 7 
and 8 shows that the system delay is largely due to lack of 
transport drives. Figure 9 illustrates that as number of 


transport drives decreases, the percentage of total time 
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MODEL OVERG@AD* . —= —— ee fe 


4 
Input Rates 

Average (Data Set Requests 
System per Hour) 
Response 
Time 3 
(Minutes) 

2 

- 





3 2 1 


NUMBER OF TRANSPORT DRIVES 


*Model Overflow means requests were arriving faster than 
they could be processed - Result: Large backlogs of jobs 
that could not be handled by simulation program. 


Figure 7 AVERAGE SYSTEM RESPONSE TIME 
VS. NUMBER OF TRANSPORT DRIVES 
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MODEL OVERFLOW —$>—s——— _—| — = 


4 
Average Imput Rate 
Time (Data Set Requests 
Spent per Hour) 
Waiting for 
Transport 600 450 300 
Drive =) 
(Minutes) 
2 
1 


4 3 2 at 


NUMBER OF TRANSPORT DRIVES 


The Average Anount spent waiting for a transport drive 
varies inversely with number of TD available and directly 


with input rate. 


Figure 8 AVERAGE TIME SPEND WAITING FOR A TRANSPORT DRIVE 


a 





INPUT RATE (DATA SETS REQUESTS/HR) 


300, 450, 600 


> 


Percentage of 
Total Time 
That a Request 
Is Blocked 
Because No 50 
Transport “° 
Drives Were 
Available 


2) 


4 3 2 1 
NUMBER OF TRANSPORT DRIVERS 


*Model Overflow 
This is the percentage of total time that any request 


was blocked because of non-availability of a transport drive 
and not the utilization of each transport drive. 


Figure 9 PERCENTAGE OF TOTAL TIME THAT A DELAY WAS DUE TO 
NON-AVAILABILITY OF A TRANSPORT DRIVE 
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that at least one request iS waiting for a transport drive 
increases. This naturally pushes the average time delay 
due to non-availability of transport drives up. The higher 
the utilization of all transport drives, the greater the 
average system response time. The statistics in Figure 9 
are the percentages of total time that a request was wait- 
ing for a transport drive. As average delay for transport 
drives increases, then more jobs arrive and enter the queue. 
Naturally the average number of requests per topplate in- 
creases as illustrated in Figure 10. The average number of 
requests waiting for a topplate increases because more 
requests arrive before a transport drive can be secured. 
The average number of requests for a topplate varies in- 
versely with number of transport drivers available and 


directly with input rate also illustrated in Figure 10. 
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[ move OVERFLOW — — “fF ee {7 a 


Input Rate 
Average (Data Set Requests 600 450 300 
Number of per Hour) 
Requests 
Waiting per 
Topplate 
S 
2 
1 
4 3 2 de 


NUMBER OF TRANSPORT DRIVES 


The minimum topplate queue length before a search was 
initiated is l. 


Figure 10 AVERAGE NUMBER OF REQUESTS PER TOPPLATE 
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B. SYSTEM RESPONSE VERSUS QUEUE LENGTH 

This section will attempt to find a logical minimum 
queue length for topplates. A queue is defined for each 
meeerace to accept all requests for that topplate. As re- 
quests arrive they are arranged sequentially by the data set's 
physical location on the tape. The sequential ordering re- 
duces the average tape search time and therefore the average 
system response time. 

When the minimum queue level is set to greater than one, 
then that topplate does not request a transport drive until 
enough requests are in the queue. Even when the topplate 
does request a transport drive, it has to wait until one is 
available and during that wait, more requests could enter 
the queue and be put in their sequential location. In 
other words, the number of requests batched for each search 
may be greater than the minimum queue level but never less 
than it. 

The optimal minimum queue length would be the length 
that minimized the average search time per request. Al- 
though the logge minimum queue length as presented later 
in this thesis may be optimal for some users, in general 
the optimal will have to be determined by the user for his 
particular installation. This analysis provides a first 
approximation for determining the optimal minimum queue 
length. Referring to Figure 11 will be of assistance in 


the following analysis. The goal is to try to minimize 
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average search time per request after the batch of requests 
has been released and the searching has been initiated. 
In order for the batch to be released for searching, a 
transport drive must be secured. Consequently, the number 
of transport drives has no effect on the average search 
time per request. The minimum search time is achieved when 
the queue is of a length between 10 and 15 requests. The 
data set size is held constant. Consequently the read time 
is constant for all queue lengths and only the search time 
varies with queue length. (See Figure 1l). The average 
system response time is also dependent upon three other 
hactors: 

1) Number of transport drives 

2) Topplate queue length 

3) Input rate. 

By holding the input rate constant and varying the other 
two main factors, a average system response time is 
sensitive to both the number of transport drives and the 
topplate queue length, as Shown in Figure 12. The linear 
increase in average response time is due to waiting for the 
arrival of enough neGueses to ell thesqueve to the minamum 
queue level for the eeebiite. This conclusion is supported 
by Figure 13 which shows the average time that a data set 
request is held in the queue before being released for 
searching and processing. 

This conclusion is valid except when the number of 


transport drives becomes a critical factor in system re- 


Sponse time. In the case of one or two transport drives, 


44 





the average time in the system increases drastically. 
This is due to the average time spent waiting for enough 
requests to fill the queue for the topplate, as illustrated 


by Figure 14. 
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20 


Average 2 

Time a 

for Each 

Search Plus 4 Number of Tape Drives 
5 

Read 


(in Seconds) 


10 





1 bs) 10 eS 20 


MINIMUM TOPPLATE QUEUE LENGTH 


Constant variables input 450 data set requests/per hour 


Uniform distribution of data sets on tape 


The read time is constant for all queue lengths so only the 
Search time varies. 


Figure ll AVERAGE TIME TO FIND AND READ EACH DATA SET REQUEST 
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Number of 


Transport 
Drives 


Average 
System 
Response 
Time 
(Minutes) 





It 5 


10 15 20 
MINIMUM TOPPLATE QUEUE LENGTH 


Most of time is spend waiting for enough requests to arrive. 


*With 1 transport drive the model overflowed. 


Figure 12 AVERAGE SYSTEM RESPONSE TIME 


VS. MINIMUM TOPPLATE QUEUE LENGTH 
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Average 

Time in 

Queue before 
Being 

Released 3 
to be 

Searched 
(Minutes) 


Transport 
Drives 


ih 5 10 > 20 
MINIMUM TOPPLATE QUEUE LENGTH 


Delay in queue when only 2 transport drives are available 
is due to the non-availability of transport drives. 


Delay in queue when 3 or 4 transport drives are avaialble 
is due to wait for enough jobs to arrive. 


Figure 13 AVERAGE TIME IN QUEUE BEFORE BEING RELEASED 
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NUMBER OF TRANSPORT DRIVES 


0.79 
Percentage 
of Total Time 
That a Request 
Is Blocked 
Because No 
Transport 0.5 
Drive Is 
Available 


i 3 eo 20) 15 20 
MINIMUM TOPPLATE QUEUE LENGTH 


Note: This is percentage of total time that any request was 
blocked due to the non-availability of a transport 
drive and not the % of time a TD is busy. 


Figure 14 PERCENTAGE OF TIME A REQUEST IS BLOCKED DUE TO 
NON-AVAILABILITY OF A TRANSPORT DRIVE 
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C. SYSTEM RESPONSE TIME VERSUS DATA SET ARRANGEMENT CRITERIA 

The file management policies can affect the system 
response time. A careful analysis of three different methods 
or data set arrangement on TBM Tape will be described in the 
following paragraphs. These three arrangement methods are 
uniform (or sequential), by activity, and by data set size. 
Since the data set arrangements are mathematically formulated, 
no claim is made that they represent any one particular in- 
stallation and the analysis should be used only as an 
indication of many possible results. 

1. Uniform Arrangement 

The data sets are placed in Sequential order on a 

first-come first-located basis. Consequently all sections 
of tape have equal probabilities of being referenced. 

2. Activity Arrangement 

The data set's location on tape is determined by its 

activity over the past time period. Every time a dia Soe 
is referenced, the tally track is also updated. After 
several time periods, the data sets on the tape would be 
Peievancea according to activity during those time periods. 
The more active the data sets are, the closer they are to 
the middle of the tape, (See Figure 15). The idea is to put 
the most frequently accessed data in a location where they 
can be accessed most quickly. This idea is very appealing 
to installations that follow the adage - "90% of the accesses 


are to 15% of the data base." 
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The principle advantages of the arrangement by 
Aetivity 1s: 

1) The reduction in the average search time; 

2) Data sets may be rearranged independent of the 
host computer, in Slack times. 

The principal disadvantages are: 

1) The necessary bookkeeping in order to arrange by 
Sec avairty . 

2) The need to re~order the data sets on a tape 
perledacally. 

3) The possible placement of a very large data set 
in the middle of a tape which means greater 
search times to many other data sets. This 
result may cancel and reverse any savings gained 
by activity arrangement. 

3. Size Arrangements 

The data set's location on tape is determined by its 

Size. The larger the data set, the farther it is located 
from the middle of the tape. Thus, small data sets are 
located towards the middle of the tape -- See Figure 16. 
The logic behind this arrangement is based on activity over 
a certain section of tape. Assume the data sets have a 
uniform hit ratio. The section of tape containing the 
smaller data sets will have a higher hit ratio per length 
Be tape. The idea of putting the highest hit section of 
tape in the middle of the tape is to minimize the search 
times and thereby reduce the average access time. The major 
problem with this idea is the assumption of a uniform hit 
ratio on all data sets. 

4. S1ze and Activity Arrangement 


The purpose of combining activity and size criteria 


to arrange data sets is to avoid putting a large data set in 
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the center of a tape. In the proposed and tested system, 
the probability of a large data set having a high hit ratio 
is so small that the statistical effect to the average 
search time is insignificant. Consequently this combination — 
arrangement was not simulated by the model. 
5. Analysis 

The arrangement of the data sets on tape by activity 
or size iS an attempt to reduce the average search time. 
Figure 17 illustrates how average search time varies with 
the type of data set arrangement as well as the number of 
transport drives. When four transport drives are available 
and the delay in obtaining a drive is low, then a significant 
difference occurs between uniform arrangement of data sets 
and size or activity arrangements. As the number of trans- 
port drives decreases and the delay before obtaining one in- 
crease, the differences in search time decreases. The entire 
delay allows the queue to build for each topplate. The data 
set requests are arranged on the queue by increasing 
location on the tape. Consequently the average search time 
decreases as queues mee longer until queue reaches length 
of 15-20. (See Figure 11.) An input rate of 90 jobs/hour 
puts a sufficiently heavy load on the model so that when 
constrained by only three transport drives, the advantage 
of data set arrangement is reduced, but not eliminated. The 
average search time reduced for all three aeeangenenes as 
the queue length increased, but there is no assurance the 


queue length increased proportionally for all three 
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Average Time Data Set 
To Search and Arrangement 
Read Each Criteria 
Request 20 
(Seconds) 
—— Uniform 
ACtivaty 
——— "Suze 
15 
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4 3 Ba 


NUMBER OF TRANSPORT DRIVES 


Constant: Minimum Queue Length 1 
Input Rate 90 Jobs/Hour 


The read time for each case iS constant so only the search 
time varies. 


Figure 17 AVERAGE TIME TO FIND AND READ EACH REQUEST 
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arrangements. The crossing of the activity and size curves 
is not considered significant. Since the search time did 
not give a clear answer the next best idea is to check the 
system response time. Figure 18 illustrates the average re- 
sponse time versus the number of transport drives. The 
advantage of data set arrangements is best illustrated in 
this graph. The average system response time is significantly 
lower for activity arrangements in all cases shown. Remember 
that these times are estimates obtained by simulation of a 
typical installation and they do not necessarily represent 
any one particular installation. 

An analysis of system response time versus topplate 
queue length is illustrated in Figure 19. The results in- 
dicate that with four transport drives that the average 
system response time is almost linearly increasing with 
increaSing topplate queue length. The linearity of the in- 
crease indicates most of the increase is due to the system 
waiting for enough topplate requests to arrive to initiate 
the search. 

6. Hardward Utilization 

The utilization of the entire TBM System hardware 
tends to depend on the utilization of the critical components 
of the system, the transport drive and the External Data 
Channel Processor. These two devices affect the usage of 
the other devices significantly. 

In the first situation where a large amount of data 


is to be read (or even worse, updated in place) without 
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4A 3 2 


NUMBER OF TRANSPORT DRIVERS 


Input Rate 90 Jobs/Hour 


*Model Overflow - the time is an average of all transactions 
completed up until simulation stopped. 


Figure 18 SYSTEM RESPONSE TIME VS. DATA SET 
ARRANGEMENT CRITERIA 
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Average 

System Uniform 
Response / / Size 
Time 5 te 
(Minutes) a ~ASELVITCY 


Data 
4 }- Set 
Arrangement 
Criteria 





dk 5 LO igs 
MINIMUM TOPPLATE QUEUE LENGTH 


Constant: Input Rate 90 Jobs/Hour 
Transport Drivers 4 


Figure 19 AVERAGE SYSTEM RESPONSE TIME VS. TOPPLATE QUEUE 
LENGTH 
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involving much search time, the result is a backlog for and 
delay due to the EDCP. An example of this would occur when 
six transport drives search short distances and then read a 
large number of blocks. If this is the problem that a user 
encounters, then he probably should: 

1. Use another EDCP and increase his throughput or 


2. Decrease the number of transport drives since 
they would have poor utilization. 


More often the TBM* Memory System is transport 
drive bound. The average search time is much larger than 
the average read time. The EDCP may sit idle while all the 
transport drives are busy searching different topplates. If 
a user finds his system transport bound, then three major 
steps can be taken: 


1. Queue requests for each topplate in sequential 
order by location. 


2. Arrange his data sets on tape by some criteria, 
such as activity or data set size, in order to 
Shorten average search time. 

3. Use more transport drives. 

Figure 20 illustrates the changes in the hardware 
utilization as the oe of transport drives varies. The 
percentage shown for transport drives is the percentage of 
total time that a request was blocked due to non-availability 
of a transport drive. As requests were blocked at higher 


percentages of total time, then utilization of the topplates, 


the data channel, and the disc controller decreased. 
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INPUT RATE = 90 Jobs/Hour 
MINIMUM TOPPLATE QUEUE LENGTH = 1 


Average Data Disc Transport 
Topplate Channel Controller Drives* 









Ans Number of 
. Transport 
Drives 
Percentage 
of Total 4 
Time a Device US A US A USA U SA 
Is in Use ; 
Data Set Arrangement Criteria 
U - Uniform 
S - Data Set Size 
A - Activity 
1.0 
0.3 
3 
USA USA USA US A 
1.0 
0.5 
2 


USA USA US A USA 


*Percentage of total time that a request was blocked due to 
non-availability of a transport drive. 


Figure 20 HARDWARE UTILIZATION 
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V. CONCLUSIONS 


The TBM* Memory System is one of the leading contenders 
in the recently developed mass storage technology. Mass 
storage systems offer many advantages over existing tape and 
disk systems and would be effective in a wide range of 
applications. Some major applications for the TBM System are: 


1) To replace large conventional on-line and off-line 
tape library systems; 


2) As supplements to enlarge disc systems; 


3) To allow data to remain on-line that would usually 
be stored off-line. 


The main features of the TBM* Memory System are: 


1) Complete system redundancy. Any device of the 
system can be excluded by using another device 
like it, providing a duplicate is in the system 
Cenfiguration. 


a) In the case of equipment malfunction the whole 
system 1s not down waiting for one device to be 
repaired. 


b) It is possible to schedule maintenance downtime 
without sutting down the whole TBM System. If 
the downtime is scheduled during slow periods, 
then little, if any, delay is incurred in 
response time. 


2) Modular Expansion. The installation can buy for its 
needs at the present and expand a little at a time 
as the installation increases in size and the activity 
increases. 


3) Reliability. Due to equipment redundancy the system 
reliability 1s very good. Due to redundancy record- 
ing techniques the data reliability is also very 
good. 


4) Large Library On-Line. The TBM* Memory System pro- 


vides a feasible method of keeping a large data base 
on-line at all times. 
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3) 


6 ) 


Rapid response. A request to the TBM System can 
achieve varying response times depending upon the 
access techniques used. Due to a process of back- 
filling the host seldom sits idle waiting for the 
request to be answered. Backfilling is the process 
where the host puts a job on an inactive queue after 
sending data set request to the TBM* SCP. 


Low Cost Per Hit. The large capacity of the TBM 
System gives the user an economical means of stor- 
ing a large data base. Additionally the tape is 
reusable which reduces the costs. 


Based on the results of the simulation model, the 


following conclusions are made: 


i) 


2) 


a) 


a) 


3) 


6 ) 


7) 


The average system response time varies inversely 
with the number of transport drives and directly 
with the input rate. 


One minute average response time is possible with: 
a) 2,3, or 4 transport drives if the arrival rate is 
60 jobs per hour, b) 3 or 4 transport drives if the 
arrival rate is 90 jobs per hour, c) 4 transport 
drives if the arrival rate is 120 jobs per hour. 


feewet- depaval rate of 90.j7ebs per Nour the average 
system response time more than doubles for each 
transport drive less than four. 


At the arrival rate of 60 jobs per hour the decrease 
in average system response time due to adding more 
than 2 transport drives to the configuration is less 
than 20 seconds. (Response decreases from 50 seconds 
to 30 seconds.) 


The average time that each data set request spends 

waiting for transport drive increases as the number 
of transport drivers decreases and varies directly 

with the average system response time. 


If the arrival rate is 60 jobs per hour or greater 
and the data is located on a minimum of 5 topplates, 
then more than one transport drive is required to 
prevent a backlog of jobs indefinitely. 


At the arrival rate of 120 jobs per hour a minimum 
of 3 and preferably 4 transport drives are needed. 
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8 ) 


9) 


10) 


a) 


NZ") 


eS) 


14) 


As the arrival rate increases from 90 to 120 jobs 

per hour with only three transport drives available, 

the average number of requests waiting on a topplate 

queue increases from less than 1 to over 3. At times 
the queues get very long. If. 4 transport drives are 

available then the average queue length is less than 

1 for all the arrival rates tested. 


The number of transport drives has no effect on the 
average search time once the requests have been re- 
leased from the gueue to be Searched. 


The logical minimum topplate queue length was found 
to be between 10 and 15 requests. This minimizes 
the average search time per request, but not neces- 
Sarily the average system response time. (See 
numbers 11 and 12). 


The average system response time can actually be 
increased by setting the minimum topplate queue 
length between 10 and 15 due to the requests sitting 
in the queue for long periods waiting for enough 
requests to arrive to meet the required minimum, 


If sufficient transport drives are available such 
that waiting for a transport drive seldom occurs 
then the average system response time increases 
linearly with increasing topplate queue length. 

Mie ianedrity andicates that. jobs waited long «*”~- 
periods in the topplate queue waiting for enough 
requests to arrive. Thus in the case of low input 
rates, the advantage of faster search times by queu- 
ing requests is defeated by having to wait to fill 
the queue. 


Minimum topplate gueue lengths make significant 
Savings in system response time if many data set 
requests arrive ina burst or from high input rates. 


If the minimum topplate queue length is 5 or less 
then 3 transport drives would reduce system response 
time Significantly. For example, at 90 jobs per 

hour with a topplate queue length of one, the system 
response time is reduced from 6.5 minutes to 0.75 
minutes when the number of transport drives is in- 
creased from 2 to 3. If the minimum topplate queue 
length is greater than 5 then the reduction in system 
response time from the third transport drive is less 


‘than one minute. The fourth drive makes no difference 


in system response time. 
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15) The average search time to locate a data set is 
dependent on the data set arrangement criteria. 
Size and activity arrangement produce savings of 
5 to 15 percent when compared to uniform arrangement. 


16) Data set arrangement criteria of size or activity 
produce Significant savings of near 50 percent in 
average system response time. The amplification of 
Savings 1s the result of queuing more data set re- 
guests before searching which leads to more efficient 
searching. 


17) Arrangement by activity reduces average system re- 
; Sponse time more than size arrangement criteria. 


18) At 90 jobs per hour with 4 transport drives the 
average utilization of the 5 topplates, the single 
data channel and a Single disc controller were all 
about 70 percent. The transport utilization was 
20 to 35 percent. With three transport drives, the 
average utilization of the transport drives increases 
to near 100 percent causing the utilization of the 
other devices to decrease to about 65 percent. With 
2 transport drives the system was definitely trans- 
port drive bound with their utilization at 100 per- 
cent and the other device utilization decreaSing to 
40 to 60 percent. 


19) Thus the utilization of the TBM* hardware is inversely 
propertional to the non-availability of transport 
drives. Higher utilization of hardware is possible 


by dddingmextra tramsportsdrives to the system con- 
figuration, (a maximum limit of six). 


A. RECOMMENDATIONS 

The author makes no claims to have tested all the possible 
configurations of the TBM* Memory System. The recommendations 
Stated are based on the analysis derived from the model and 
the research performed. 

The request loads tested in this research are 60, 90 and 
120 jobs per hour resulting in data set requests of approxi- 


mately 300, 450, 600 per hour. The following are equipment 
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recommendations with exceptions included. The exceptions 


are possible cases which may occur at certain installations. 


1. Equipment Recommendations 


The following recommendations are made assuming the 


user requires normal operational reliability. If the users 


require very high operational reliability then at least two 


of each hardware device should be included in the system 


configuration. 


1) 


2) 


3) 


4) 


Only one supervisory control processor is re- 

guired. Unless the installation could make use 
of a redundant SCP (PDP-11) in some other appli- 
cations, then there is no need for a second SCP. 


Only one External Data Channel Processor is 
required. The host's throughput requirements, 
as derived by the hit ratio of data within TBM 
blocks and the user's data manipulation capa- 
bilities, determine whether throughput should be 
greater than 0.75 megabytes/sec. Only if the 
user requires higher throughput should an extra 
EDCP be used. 


Only one data channel is recommended. In read 
applications, the EDCP and the DC are busy at 

the same time, one connected to the other while 
the data flows through. Consequently only one 

DC would be necessary unless a lot of updating 

is to’be performed. If a large percentage of 
referenced files are to be updated in place or 
written on a write tape, then a data channel will 
be tied up not only with the write data transfer 


jot, wath the write verificatten. In the write 


case, the data channel is used more and may become 
a bottleneck. If having just one data channel 
does delay the system significantly, then another 
data channel should be added. 


Two Disc Controllers. The disc storage is used 
by the TBM system for file directory information 
and data transfer to the host. The host uses 

the disc storage not only for data transfer to 
the TBM System but also as storage for frequently 
referenced files and perhaps as a work storage 
area. The only path to the disc storage is 
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2) 


6) 


through the disc controller. Only one entity 
can have a controller at once so if only one 
disc controller was available and the host was 
in control, then the TBM system would have to 
wait. The dual disc controller system allows 
one entity to have a disc controller and one 
disc drive, while another entity can use the 
other disc controller and reference any other 
disc drive. This method results in very few 
delays and each entity has the benefit of access 
to disc storage. The amount of disc space re- 
quired is host dependent. The TBM memory system 
needs only a file directory on disc and suffi- 
cient room to store one TBM block (this, of 
course, would be a ridiculous waste of TBM 
capabilities). 


Transport Drives. The TBM* Memory System will 
be transport drive bound in normal applications. 
The number of transport drives to include in the 
TBM System depends upon the desired system re- 
sponse time. The model can adopt to any con- 
figuration and the user should try to estimate 
his needs and then simulate the proposed system 
with the model. In general a user will find: 


a) If he has a lot of topplates and an equal 
hit ratio, then the minimum topplate queue 
length should be very low and an extra trans- 
port drive may be necessary. 


b) If he has a certain amount of data spread out 
over thirty topplates, then he will need more 
transport drives than if he had the same 
amount of data over fifteen topplates. 


c) If many requests enter the system at the 
Same time, they should be batched at the 
topplate until all requests have been 
received, and then the search should begin. 


qd) If a continuous high input rate exists, the 
minimum topplate queue level should be set 
between 10 - 15. 


e) If a continuous low input rate exists, then 
the minimum topplate queue level should be 
set to ones 


Dual Transport Modules. The number of dual 
transport modules to include in the user con- 
Figuration is totally dependent upon the amount 
of data the user wishes to have on-line. One 
Dual Transport Module is required for every 
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two topplates. One possible application of the 
TBM system is to store off-line information in 

a condensed form. One TBM tape is equivalent 

to 500 conventional tapes and can be stored in 
an off-line library.in the space of a single 
Deaens= ring binder. ain thus application only 
one Dual Transport Module is necessary since 

the tapes are taken off-line after being filled. 
Another application is for interactive CP storage. 
In order to minimize access time, the data is 
stored on one-seventh of a tape. In this 
application many more dual transport modules are 
necessary than normally used for the same amount 
Of ntobmne t Lon. 


2. Summary 
The individual user must carefully define his needs 
before selecting a particular memory technology or memory 
system. The TBM* Memory System uses new mass storage tech- 
nology and is indeed rather complicated. The system model 
is presented to help the user understand the system. The 
sensitivity analysis 1S presented as a guide for the user in 


selecting the proper configuration for his installation. 
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APPENDIX A 


TBM* MEMORY SYSTEM HARDWARE 


A. COMMUNICATION AND CONTROL SECTION (CCS) 

The control section aoneses of the System Control 
Processor (SCP), which is a PDP-1l computer, External Data 
Channel Processor (EDCP), discs, and channel switching 
hardware. 

1. Supervisory Control Processor 

The SCP controls overall functioning of the TBM 
system. The SCP interprets requests for search and data 
transfer operations initiated by the host computer. The 
SCP then allocates Transport Modules, Transport Drivers, 
Data Channels and disc space necessary to complete the 
operation. A basic description of the PDP-1l by digital 
corporation is given in Figure 2l. 

2. External Data Channel Processor (EDCP) 

The External Data Channel Processor is usually a 
PDP-11 like the SCP. - The purpose of the EDCP is to act as 
an interface between the TBM* Memory System and the shared 
discs. The throughput of the TBM System is limited by the 
number of EDCP's. Reasons for having an EDCP instead of 
directly transferring data from the data channel to the disc 
are complex. A brief explanation of the reasons are as 
follows: 


a. The transfer rate of the TBM* Memory System is 
approximately 6 X 10° bits per second on either 
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Figure 21 


CHARACTERIZATIONS OF THE PDP-ll 


Manufacturers & Model 


DATA Formats 
Word length, bits 
Fixed-point operand length, bits 
Instruction length, bits 


MAIN STORAGE 
Storage type 
Cycle time, microseconds/word 
Minimum capacity, words 
Maximum capacity, words 
Parity checking 
Storage protection 


CENTRAL PROCESSOR 
No. of accumulators 
No. of index registers 
No. of directly addressable words 
Indirect addressing 
Add time, microseconds (full word) 
Hardware floating point 
Hardware byte manipulation 
Immediate (literal) instructions 
Power failure protection 
Real-time clock or timer 


INPUT/OUTPUT CONTROL 
I/0 word size, bits 
Direct memory access channel 
Maximum I/0 rate, words/sec 
No. of external interrupt levels 


PERIPHERAL EQUIPMENT 
Disc pack storage 
Non-interchangeable disc storage 
Drum storage 
Magnetic tape speed, cps 
Punched card input speed, cpm 
High-speed paper tape input, cps 
High-speed paper tape output, cps 
Other standard peripheral units 


lnatapro 40 Features and surveys, 


Tol AbO@E Mini Computers, " 
Datapro Research Corporation, 1972. 
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PDP-1l1 by niga 


16 
16 
16/32/48 


Core 

i. 2 
4,096 
124K 
Optional 
No 


8 

ip toss 
a2, 765 
One-level 
ZS 

No 
Standard 
Standard 
Standard 
Optional 


16 
Standard 
833,000 
Variable 


mes 

Yes 

No 

No 

200 

300 

300 

50 

DECtape, CRT displays, 
A/D converters, 
printers, etc. 





Manufacturers & Model 


SOFTWARE 
Assembler 
FORTRAN compiler 
Other compilers 
Operating system 


AVAILABILITY 
Date of first delivery 
Number installed to date 
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BBP Ly Sigil 


2-pass 
Yes 
BASIC 
aS 


March 1970 
Over 1000 





the read or write channel of any data channel. 
The transfer rate of the 3330 disc is approxi- 
mately 6.4 X 10© bits per second. Neither 
system can be slowed down or speeded up. The 
difference in transfer rates would be disastrous 
1f an interface buffer was not used. 


b. Originally the host put the data sets on the 
disc and entered the address in its disc file 
directory. Tnese data sets were then stored 
in one or more TBM blocks. The host then signals 
the TBM* Memory System that a particular data 
set is wanted and to put the data set in just a 
certain location that 1S now available on disc. 
The host is expecting that data set to be in 
that location and has notified its file directory 
in order to get to the data set. The data stored 
on TBM tape is addressed for the original loca- 
tion so the address must be changed. This address 
changing is done on the fly by the channel 
Simulator just before the data goes to the disc 
controller. 

The PDP-1l used as the EDCP must contain at least 3 
core buffers of 16 K bytes each in order to allow the TBM* 
memory system to send data continuously. The fourth 16 K 
byte buffer is an extra which, when used, allows the TBM* 
Memory System to send data continuously and the 3338 disc to 
accept data continuously or vise versa, (see Figure 22). 

Since one TBM block is approximately 130 K bytes 
and any number of blocks can be read as long as disc space 
1s available, then it is obvious that the data cannot stop 
in the EDCP. The 16 K byte buffers are used in a flip-flop 
Manner with the data channel filling several buffers before 
data starts transferring to disc. 

3. IBM 3330 Disc System 

According to Ampex, the best disc system to be 

connected to the TBM* Memory System is the IBM 3330 disc 


system. The 3330 disc system has a transfer rate close to 
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the TBM* transfer rate. The transfer rates are not the 
same, consequently the 3330 disc system is not optimal for 
the TBM* Memory System. The IBM 3330 disc drive with 
corresponding disc pack #3336 and disc controller #3830 is 
the most advanced of existing disc systems that satisfy the 


needs of the TBM* memory system, 


IDENTIFICATION 
Disc Pac # 3336 
Dasc Controller # 3830 
TECHNOLOGY 
Density - BPI 4040 
No. of Cylinders 404 


Removable Pack 
Movable Head 


Pa 

CAPACITY 
Per Drive (M bytes) 100 
Per Cylinder (K bytes) 247 
Min/controller 200 
Max/controller 800 

PERFORMANCE 
Minimum Access Time (M sec) ~ 10 
Maximum Access Time (M sec) 55 
Average Access Time (M sec) 30 
Average Rotational Delay (M sec) 8.4 
Rotational Speed (RPM) 3600 
Transfer Rate (K bytes/sec) 806 


One cylinder of a 3330 disc is capable of contain- 
ing two TBM blocks. Each 3330 disc has 400 cylinders for 
user data. A quick computation shows that 800 TBM blocks 
can be contained on one disc pack. The TBM* Memory System 
has to format all its data in the large TBM blocks (i.e., 
130 K bytes/block). The user would seldom need that much 


information at any one time. Consequently the methodology 


Cee 





assumes that uSer outputs data onto disc in small segments 
and keeps a disc file directory of where the data was placed. 
As far as the user knows, all of his data is assumed to be 
stored on disc. 

The purpose of the TBM* Memory System is to gather 
all the segments and form 130 K byte blocks. These blocks 
are then written on TBM tape and the TBM file directory 
knows all the data sets located within each TBM block. 
Whenever a uSer wants a data set stored on TBM tape, the 
host computer lets the SCP know what data set and where to 
put the entire TBM block on disc. When TBM nagiue the data ~ 
on disc, then the SCP Signals the host that the data is 
available and the user's data set, perhaps a small portion 
of the TBM block, is read from disc as though the data has 


always been on disc. 


B. DATA STORAGE SECTION (DSS) 
The data storage section consists of: 
A. 1 to 6 transport drivers. 


B. 1 to 32 transport modules, each consisting of two 
topplates. 


C. 1 to 3 data channels. 

Figure 23 shows an expanded data storage section. As 
illustrated in the figure, data never passes through the 
transport drivers, only through the data channel and to the 
transport module. The switching matrices allow any of the 
Six transport drivers to be connected to any of the 64 top- 


plates. Once this is done and the search has been completed, 
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Figure 23 EXPANDED DSS CONFIGURATION 
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the resulting combination of transport driver and top- 
plate can be connected to either the read or write channel 


ee amy Of the three data channels. 


1. Transport Driver Module 

The number of Transport Drivers in a system determines 
the number of tape transports which can be in operations 
simultaneously. Up to six Transport Drivers can be imple- 
mented in a Single system allowing for six simultaneous 
address searches (seeks) or read/write operations on as 
many different tape transports. 

The transport driver contains a mini-computer and 
the power, servo, and control electronics necessary to 
Seerate a tape topplate. As directed by the TBM System Con- 
eee Processor, the mini-computer controls the topplate to 
which it is connected during search and, in addition, a read 
erea write data Channel during data transfer. It also 
monitors transport and data channel status and reports I/0 
completion with status to the TBM System Control Processor. 

The ee operates in a real time process 
control mode. It operates as a Slave to the TBM System 
Control Process receiving high level operation commands. 

It executes commands by performing all the detail control 
and communication functions required by servos, address and 
tally track functions as well as read/write data channels. 
The mini-CPU is used extensively for system diagnostics, 


alignment and maintenance. 
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ie System Control Processor continuously monitors 
the status of the Transport Driver CPU. It also has the 
capability of reloading the mini'ts core memory as required 
without manual intervention. (See Figure 25). 

User data is handled exclusively via the TBM Data 
Channel Module. The Transport Driver involvement in data 
transfer is limited to channel control and error reporting 
to the System Control Processer. 

2. Dual Transport Module 

The number of dual transport modules determines the 
Overall storage capacity of a TBM system. A single system 
may have a minimum of one and a maximum of 32 dual trans- 


oil bits. 


port modules, each with a capacity of almost 1 

A dual transport module contains two tape topplates 
and the switching elements necessary to connect the tape 
transports to transport drivers. The switching elements 
are packaged in plug-in drawer units and can provide for 
both topplates to be simultaneously connected to any two out 
of a maximum of six transport drivers. 

The tape topplates (see Figure 26) have all the 
mechanical Bienenes necessary to move tape but only a bare 
minimum of electronic components. The electronics necessary 
to drive reel, capstan and video head motors, address and 
tally track ey electronics, general Concenol) logue, 
etc., are located in the Transport Driver Module while the 


electronics required to read and write digital data on tape 


is in the Data Channel Module. This approach minimizes the 
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cost of expanding the on-line data capacity of a system 
because of the Significant reduction in the cost of dual 


transport modules. 


Specifications 

Tape Speeds : oJ, os, 243, S80, 1000 ips 

Read/Write Data Rates : 6 X 106 bits per second 

Recording Media : Standard 2 inch magnetic video 
eape 

Tape Reel Capacity 2 one, 4 Oe bits 

Transport Module Capacity: 9.2 X 10 pases) (13 IBM 3330"s) 


Tape Data Format : Addressable Blocks (Std. 
block 10° pits) 
3. Data Channel Module 
Each Data Channel Module has a read and a write 
channel which can be operated simultaneously, each at a 
rate of 6 X 10° bits per second. A Single Data Channel 
Module therefore provides a throughput capability of 


12 x 10° 


bits per sow A maximum of three Data Channel 
Modules can be implemented in a TBM system, providing six 
read/write channels. Any combination of six channels can 
be operated simultaneously. The maximum Data Channel through- 
put 1s 36 X 1 bits per second, but EDCP's limit the system 
Porougniput to 12 X 10° bits per second. 

The ‘advantages of extra data ahenraeile are: 

1) System redundancy (backup) 

2) Maintenance downtime 

3) Write verification 

The Data Channel contains all the electronics neces- 
Sary to handle the transfer of user data. In the write mode, 


data first enters the data interface buffer section. From 


there data is routed through the error detection and 
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correction encoding logic, through FM encoding electronics 
and 1S recorded on tape. 

In the read mode, user data 1S reproduced from tape 
and restored by the FM read electronics, checked by the 
error detection and correction decoding logic, and then out- 
putted via the read channel data interface buffer section. 

The error detection and correction logic is imple- 
mented to correct any single bit error occurring within any 
985 bit block of user data while detecting occurrences of 
larger error bursts. See Appendix B. 

The Data Channel Module contains switching elements 
to simultaneously connect the read and the write channel to 
any two of a maximum of 64 tape transports. Figure 27 shows 
the Data Channel's functions as data is sent through the 


Data Channel. 
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APPENDIX B 


RELIABILITY 


A. DATA TRANSFER RELIABILITY 
1. Data Error Detection and Correction (EDC) 

Every bit is recorded twice - two adjacent video 
vertical tracks. They are separated by a vertical distance 
of approximately three-fourths inch. Errors due to defec- 
tive spots on new tapes or defects caused by usage (i.e., 
scratches) are reduced and nearly eliminated by the 
separation. 

Two of the eight heads on the rotating head drum 
are always in contact with the tape, (see Figure 28). This 
provides for the simultaneous recording and reading of the 
redundant data. The data channel takes the redundant data 
from each of the read heads and simply combines the signals. 
Dropout in either signal causes a drop in signal level rather 
Ehan a conplesew loss of signal. Thus no decision is required 
when an error occurs. The logic implemented in the channel 
will detect and correct any single bit error in a 985 bit 
block of user data. It will detect occurrences of larger 
error bursts. Re-reads are automatically initiated if error 
bursts are detected. 

The following is how a TBM block is broken down for 
error detection by the data channel as the data passes 
through. One TBM block consists of 182 segments or 1091 


code words. Therefore, each segment contains six code words 
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except for the first segment of each TBM block which 
contains only five segments. (See Figure 29). 


One TBM block = 182 segments 


1091 code words 

Therefore, each segment equals six code words (last 181 
segments of each block), except for the first segment which 
equals five code words. Each code word consists of 985 


bits of user information and 38 error detection check bits. 


+) =a 


Data 985 Bits EDC 
38 Bits 


CODEWORD 
There are two segments on each track before the first 
code of each segment which is a segment initialization. At 
data transfer rate read speed of 5 ips, the segment initiali- 
zation takes 32 m sec. " 
Example: i traek: 
2 segment/track: 


-6 code words/segment: 


Track A 


Initialization 


Segment l oe 
Segment 


Segment 2 
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TBM uses redundancy of recording in error detection 
and correction so each bit of information is recorded twice , 


simultaneously, by two different heads. 


Example: 


Two Tracks: Track 1000 & Track 1001 


segment A Seement 5 
22 Ceodevord 6 Of 
Seeine ite 5 
segment B segment C 


Codeword 6 of 
segment B 

The 38 EDC check bits are for detecting errors both 
Single and multiple errors within a code word. An example 
will best explain how the check bits work. 

Example of a single error detection and correction: 
A string of data is transferred. 

GO 11 T0a7 100001007101 0001/1001 160117 

Suppose one error occurred in transmission. Suppose 
a l instead of a 0 toe first bit transmitted. Logically stack 
each successive 8 bit string on top of the preceding one. 


~ 


10011101 
10000100 
POUT OO1L 
10110011 
Count the number of ones in each row and column: 
If odd: Denote with l 
If even: Denote with 0 


Gane., Rew 1 has five 1's. Therefore the cheekbit for Row 


eases 1.) 
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Check Bits for Rows 


LOOMED 1 1 
10000100 0 
1O11LOO1L af 
10110011 1 
00011001 - Check Bits for Columns 


Now by combining check bits for rows and columns, 
the result is a parity check bit word: 
1001 00011001 
Row Column 
Thais fs the restieeant cheek bit word that is com- 
pared with the initial check bit word that was derived when 
the data was stored. If the two check bit words match, then 
data 1s assumed to be correct. If the check bit words do 
not match then a trace is performed to find out where the 
error occurred. 
Example: First digit of the check bit words should 
have been a 0 instead of a 1. The data has one bit wrong, 


but the check word bit has two bits wrong. 


Correct Data Cheek Bit Data after Check 

for Rows TraTIsmiSSion Bits 
00011101 0 OOII101 u 
10000100 0 10000100 0 
10110011 Jk 10110011 1 
10110011 1 10110011 1 
10011001 Check bits for Columns 00011001 


001110011001 Check bit code words 101100011001 
Since the check bit words do not match, then a mask test 1s 


performed to find out what row and column do not agree. 
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Check Bit Words: 


Before Storage 001110011001 
After Storage HOUPEO'O'O 110.01 


Logically Compare (Absolute Difference): 
Disagreement 100010000000 


Row Column 
This indicates a Single error in Row 1, Column 1. Nowa 
comparison of check bit words is performed after switching 
data in Row 1, Column 1. If check bit word now agrees, 
then data is accepted. If check bit words do not compare, 
then some other error correction routine must be taken. 
Example of multiple error: Suppose the first two 


bmes Of the Same GCatamwere in error. 


Correct Data Check Bit : Data after Check 
for Rows Transmission Bits 
or O11101 1 10011101 1 
10000100 0 10000100 0 
HO11L0011 1 10110011 dL 
10110011 1 10110011 I. 
meorLogd Check Bits for Columns 00011001. 
101111011001 Check bit code words 101100011001 
Row Column Row Column 


The logical comparison shows a difference of two bits 
again but both differences occur in the column part of the 
check bit code word. This signals user that a multiple error 
has occurred. In some cases, correction of a multiple error 
is possible by examining the code words, but generally this 


1s not a good practice. 
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Example of another multiple error: Suppose the first 


bit of each byte was reversed. 


Data after Check 
Correct Data Check Bits Transmission Bits 
O011101 i email) 1 
10000100 0 00110011 0 
mO1PVOOLL 1 00110011 0 
10110011 dl 00110011 0 
01011001 Check Bits for Column LTO EOOL 
101111011001 Check code words 010011011001 
Row Column Row Column 


In this example there are four differences, all 
contained in the row part of the code word. 

Example of where this system does not work: Suppose 
the first two bits of the first and third bytes are reversed 


during transmission. 


Data after Check 
Correct Data Check Bits Transmission Bits 
1001101 1 10011101 1 
10000100 0 10000100 0 
10110011 I 01110011 ui 
POrLooLl 1. 1010011 ak 
11011001 Check bits for columns 11011001 
101111011001 Check bit code words Oo PETOrro0r 


The data is different but the code words are the 
same. This demonstrates the complexity that is involved in 
developing a good parity check. 

The TBM* Memory System is designed to detect and 
correct all single errors as well as detecting and initiating 
a re-read for all multiple errors. The goal was to achieve 


, 0 ; 
an uncorrectable error rate of no more than l in 2 X 10+ bites.. 
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2. Simultaneous Copy Generation 
The TBM* Memory System provides the capability of 


simultaneously writing data to multiple transports. This 
feature allows a user to create a backup tape or perhaps 
an image dump file while outputting his data to the active 
TBM file tapes. 

While files or records put into TBM are normally 
distributed between several on-line TBM tapes, the same data 
can be sequentially recorded on a tape dedicated as an image 
dump tape. Data written into TBM appears sequentially on 
the dump tape. This requires that one topplate be dedicated 
for output. If the output exceeds the capacity of a single 
dump tape, it will be necessary to take the dump tape and 
place it in an off-line library. If data cannot be recovered 
from an on-line tape, the appropriate dump tape is retrieved 
from the off-line library, mounted and the appropriate block(s) 
read. An index file is generated for purposes of cross- 
referencing. 

An example of this would be a large insurance company 
that has a large number of customers and several different 
daily updating routines. All data is stored on a conventional 
tape library system. The hit ratio will be small for each 
account, i.e., probably reference each account about twice a 
month for updating payments, finding accounts receivable, 
Statistical information collection, etc. Normal procedure 
is to make a tape of the day's transactions in numerical 
sequence by account number. A portion of the day's trans- 
actions are read in with the corresponding accounts. 
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After updating the account, the result 1s written over the 
original or written on another tape. This means an abun- 
dance of tapes, tape drives, operators to load tapes, and 
also an extensive filing system for on-line tapes and off- 
line tapes. If an error occurs, such as a lost tape or 
wrong computations, then past daily transaction tapes can 
be used to work backwards to get previous status or the old 
accounts status tape can be used if retained. 

The same insurance company could use TBM* Memory 
System and have daily old records saved either on-line or 
off-line. One TBM tape off-line is equivalent to 500 packed 
computer tapes at 800 B.P.I. and occupies the space of a 
three inch binder notebook. Daily transactions could be kept 
on one TBM tape, recently updated files on another tape, and 
past status of all changed accounts on a third tape. In 
addition, as the company grew, the TBM system could be ex- 
panded to keep pace with the company. This iS a very basic 
idea of a practical application without elaboration to 
demonstrate the efficiency of the TBM* Memory System and its 
ability to store several general generations of data for 
security. 

3. Internal Copy Generation 

Duplication of data is quite often employed as a 
means of insuring that the data is not lost. The TBM* Memory 
System can do tape duplicating independently of the host 
computer. This is especially convenient when the host is busy 


with internal matters or undergoing maintenance. The 
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compactness of the data allows the TBM* Memory System to 
store much more data in much less space, both on-line and 
off-line. 

Single blocks, logical files or whole tapes may be 
copied from a tape at any address to any other tape at the 
same or a different address. This procedure utilizes Yread/ 
Write loop connection hardware found in the TBM Data Channel 
Modules. This facility provides the user an ideal method 
for file packing, file backup (store off-line), and file 


Suganization optimization. 


B. HARDWARE LIFE AND RELIABILITY 

TBM read/write heads have an average life of 100 contact 
hours (range 80-120). Since there is no contact during search- 
ing operations, contact hours are accumulated only during 
read/write operations. A worn out head is refurbished at a 
cost of about $1500. Because of the high data rate, about 
.. billion bits of data can be transferred with the 100 
Contact hours. 

Any transport can be moved by any controller and data 
can flow through any channel and EDCP. Thus there is 
Mulipath access to data. If a hardware component goes down 
Or requires maintenance, TBM can re-configure to assure a 
continuous flow of data. Tapes can also be mounted on any 
drive. 

System diagnostics and maintenance are supplied to run 
in the off-line mode. These programs may be run independent- 


ly of the host CPU. In addition, a special programming 
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language is supplied for the off-line mode, which enables 
the operator to write his own test programs. The system 
diagnostic programs can isolate troubles down to the 
component level. Maintenance programs involve such jobs as 
positioning tape loops in vacuum chambers for best tape con- 
trol, pre-addressing and pre-checking new tapes, and testing 
the operation of servos and interval timers. Experience has 
indicated that a given block may be read and written about 


2000 times before significant tape wear occurs. 


95 


aap © 


» ee Cw 





APPENDIX C 
SYSTEM HARDWARE TIMES 
A. THE DISCONNECT/CONNECT TIMES OF TRANSPORT DRIVER AND 
TOPPLATE 
The SCP makes the decision to: 
1) Stop the read head; 
2) Disconnect the transport driver; 
3) Connect T.D. to another topplate; 
4) Lock the video head in place. 
The decision was made very quickly. The slow part is 
in waiting for the electromechianical mechanisms to make the 
necessary movements. Many situations and combinations of 
these situations can occur which complicates the timing. 
The most important consideration is whether the tape head 
is in motion. If it is, then the head must be stopped be- 
fore disconnecting. The disconnect/connect takes only 
0.525 sec. But if the tape head must first be stopped, then 
a total of 0.9 seconds is necessary. If the tape must be 
searched, then the video head phase lock takes place during 
the search. But if the tape 1S already positioned when a 
connection is made, then an additional time of 0.9 seconds 
is added to the disconnect/connect time. In quick summary: 
Disconnect/Connect Time: 
Stop Head - Disconnect - Connect - VH Lock 189 
Disconnect - Connect - VH Lock Jee 
Disconnect - Connect 0.5 
Stop - Disconnect - Connect 0.9 


The times seem short until one realizes that over 1000 


inches of tape could be searched in some situations. 
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6B. o8ARCH SPEEDS AND TIMES 

The SCP makes the decision which topplate is to be 
searched next. If a transport driver is not connected to 
the topplate, then 0.9 seconds, on the average, is lost in 
making the connection. The transport driver knows what 
address it wants. After the connection is made the trans- 
port driver knows where the tape head is presently located 
and which way it has to go. A search can be made both for- 
ward and backwards. See Figure 30. The speed to which the 
transport driver accelerates is totally dependent upon the 
distance which has to be covered. The following table gives 


the figures. 


Number of Blocks Velocity Time to Reach 
Greater than Accelerated to Velocity 
200 1000 IPS 0.7 or 1.2 sec 
46 248 IPS 360 msec 
8 83 IPS 120 msec 
0 5 IPS 15 msec 


If the search is being conducted at any speed above 5 IPS, 
then a gradual slowdown through the speeds takes place. When 
the head is within silent blocks of desired address, then the 
minimum speed, 5 IPS, is used. The tally track for the 
chosen block is read and the read or Gente pre-positioning 


takes place. 
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C. TIME TO READ ONE TBM BLOCK 

A TBM block is equivalent to 1,041,905 bits or approxi- 
mately 130 K bytes. One block uses 0.9752 inches of TBM 
tape. The data transfer read or write speed is 5 IPS. Time 
to read the entire block is 182 millaseconds (ms) or approxi- 
mately 0.2 seconds. The resultant data transfer rate is 
approximately 6 mega bits/sec. The interblock gap, (I.B.G. 
- the distance between blocks) is 5 x 10 inches. At 5 IPS, 
the time to go over the IBG is 2 ms. The beginning of each 


block has an initialization area that contains block 


information 
167 m sec 182 m sec 2 msec 
Ye 
5x1073 in Owo7 5S. ameh 
IBG Initialization TBM Block IBG 


A reread due to the data channel detecting a multiple 


error burst occurs only once out of every 3000 blocks read. 


p. TIME TO WRITE ONE TBM BLOCK 

The first consideration before writing is whether the 
tape has to be erased. Erasing is done at 5 or 83 ips 
dependent upon the number of blocks erased. 


Less than 10 blocks =. 5 ips 
Greater than 10 blocks 83 ips 


If the tape is clear then the time to erase is not 


necessary. This shows one of many advantages of uSing a 
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write dump tape instead of immediate update (assuming the 
write dump tape has been pre-erased). 

Advantages of using write dump tape: 

1. No need to erase during busy period. 

2. Update at a non-busy period. 

3. Security backup tape. 

4. Saves time (very important during busy period of day) 

Disadvantages of uSing write dump tape: 

1. Must keep a write file directory. 

2. Records are not current. 

The actual write is done at the data transfer read or 
write speed of 5 ips. Time to write is the same as the time 
to read, 182 millaseconds (ms). The resultant data transfer 
rate 1S approximately 6 megabits/sec. Each write is verified 
by re-reading just through the data channel's error detection 
section. This is also done at 5 ips. If no errors are 
detected then the data channel and disc space are released. 
Before the transport driver is released, the tally track is 
updated at 5 or 83 ips. The write is now finished and the 
transport driver is released. While the write has been 
transpiring, the SCP, in addition to controlling the write, 
has updated the write file directory if the system uses a 


write dump tape. 


l. Erase 5a0n 83 Ips 

2. Reposition Normal search speed 
3. Write* 5 lps 

4. Reposition Normal search speed 
5. Verify 5 ips 

6. Reposition Normal search speed 
7. Update tally 5 or 83 ips 


*Simultaneously the SCP updates the write file directory. 
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APPENDIX D 


TBM TAPE FORMAT AND CHARACTERISTICS 


User data is recorded on TBM tape in a block format. 
Each block contains one million (1,041,905) bits. The 
separation between blocks is 5 x 1073 inches. Individual 
blocks can be updated in place, i.e., erased and rewritten. 

There can be a maximum of 46,500 blocks per tape. Of 
these, a portion (less than a thousand) are dedicated to 
diagnostic and maintenance functions, while usually less 
than 0.5% are demarked as a result of permanent errors found 
during tape pre-testing. The demarcation indicator is lo- 
cated on the Tally track. A maximum of 45,000 blocks can 
be made available for user data. 

Three longitudinal tracks are recorded on ean tape, two 
of which are of interest to the user; the Address track and. 
tie Tally track, Figure 31. Associated with each block is 
a small segment of each track. 

The Address track field associated with one data block 
Meniiai ns both a unique tape number and a block address 
number. User tape block addresses are number sequentially 
from 1 through 45,000. Records or files may be accessed by 
physical addresses, i.e., tape number and block number. The 
user does not need to know on which transport a tape is 
mounted. The TBM System Control Processer maintains a table 


which relates tapes to transports. 
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The Tally track 1s a user oriented facility. Each 
Tally track field associated with a data block contains 
user security codes and search keys, date of last activity, 
read only interlocks, and more as shown in Figure 32. 

The Tally track is read prior to performing a block 
read, write, or erase, and updated after the process has 
been completed. At the user's option TBM will compare the 
security code words read from the Tally track to the code 
words supplied from the host CPU. Read, write, and erase 
processes can, at the user's option, be performed only if the 
Tally track code words match. 

System statistics can be generated by reading the Tally 
tracks only. The user may request that only the content of 
the Tally track be transferred to it for purposes of generat- 
ing system statistics. A typical usage of the Tally track 
information is to determine whether infrequently used files 
should be taken off-line and placed in a library or if those 
frequently used should be stored on faster devices such as 
discs or drums. 

The Tally track can be updated independently from the 
data block associated with it. This is a useful feature if 
search keys, security codes or read only interlocks are to 


be changed. Tally track key searches can also be performed. 


A. TAPE INFORMATION 
Bits are recorded serially on the track at 7800 bits/ 
linear inch of track. (Note: tracks run almost horizontal 


across TBM tape as shown in Figure 33). 
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Byte 


ah 


10 
11 
12 
13 
14 
15 
16 


Bits 


p-5 


Figure 32 TALLY FIELDS 


Function 
Last Operation Codes (Octal): 


Oke ror Maintenance 
Last OP Production Demarked Permanent Updatable 


Erase 25 26 - - 1 
Write 45 46 15 5 
Read 51 52 15 5 


22-bit User Identification Number (IDNUM) 


Month of last activity 


Day (of month) of last Activity 


Year of last activity (yeaxr-1970) 


Total Access Count (Read or Write Data) 


WierbreraAceess Count 


Number of Readings with Correctable Errors 


Number of Readings with Uncorrectable Errors 


Serial Number of Video Head Used in Write 


Alignment Number of Video Head Used in Write 
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Figure 32 (Continued) 


Write Retry Count 
Number of Write Channel Used in Write 


Number of Transport Used in Write 
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Figure 33 


TRACK DIMENSIONS 





7500 bits/lLimeéar anch of track 
189 tracks/inch of TBM tape 


This gives bit density: nr’ 

1.5 x 10° bits/square inch. 
All bits recorded redundantly. Therefore, effective data 
bit density 1S: 


iti 


bits/square inch. 
For every 985 bits of user data, there 1s another 38 error 
detection and correction bits. Therefore, the effective 


-user data bit density is: 


0.73 x 10° bits/square inch. 


B. STORAGE EQUIVALENTS 


1 TBM* Memory System Tape 500 Fully Packed 800 bpi 
Computer Tapes 


250 Fully Packed 1600 bpi 
Computer Tapes 


2500 Avg. Packed 800/1600 bpi 
Computer Tapes 


1 TBM* Memory System Tape 160 Full 2314 Disc Packs 
55 Publ 3330 Dise Packs 


1 TBM* Memory System Tape, Fully Packed $150 .0000003 


cents/bit 

1 3330 Disc Pack, Fully Packed $750 .00009 
cents/bit 

1 Computer Tape, Avg. Packed > 10° .00005 
cents/bit 


Being limited to 2000 reads or writes may at first appear 
to impose severe limitations on the system. Practically 


this is not usually the case since most files/data sets are 


LO”, 





accessed a few times per day over extended periods of 

time (very frequently accessed files should be on disc). 
For example, if a file is read once a day, that particular 
segment of tape would last approximately 2000 processing 
days or close to ten years. If the file, on the average, 
is updated once a day, the tape would last five years. 

The system keeps track of the number of times each TBM 
block has been accessed on the Tally track, and raises a 
block/file move flag when the access count reaches the limit. 
If the flag is raised, the data set can be moved to another 
section of the tape. A full TBM tape can be rewritten in 
two hours in an off-line mode. 

Searching over a section of tape does not affect the 
read/write count. Normal tape quality allows for in excess 
of 50,000 searches over any segment of tape. 

TBM tapes can be replaced for about $150, or if pretested, 


$250. 


C. TRACK FACTS 
Data Tracks: 
- Each transverse track comprises two (2) segments. 


- Each segment (except the first in a block) contains 
6 code-words: 


5910 (6x 985) data bits 
228 (6x 38) error check bits 


sage segment sync bits 
6330 igtal Hits 
=-JFaAchwoLt stored in agtracksoccupies 2932 O2000137. ) 


Ge track length. 
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~- Scanned at 792 in/s, this recorded wavelength yields 
a 6 mbit/s data stream. 


= Meackomere 3.5 mils (0.0035 in®) in width and are 
separated by 1.8 mil (0.0018 in.) guard bands. Track 
Sedemigers 5.3 mils (0.0053 in.) 


- Each data block contains 184 track spaces. 183 of these 
are used for data storage. The 184th is sacrificed to 
provide reasonable tolerances for erase start/stop. 

The loss of one complete track renders one-half of each 
of two other tracks unusable because of the loss of 
redundancy. Thus, the usable data space is the equiv- 
alent of 182 tracks. 


Comercol Track: 

- Each control track cycle occupies the same length of 
tape as eight data tracks, eight address track bits, 
er exgqht tally track bits: 

O24 ymtrs (0.0424 in.) 


- Each block space on tape yields 23 control track pulses 
and one "Frame Pulse." 


- The frequency and period of control track pulses vary 
with tape speed: 


Speed Frequency Period 
5 in/s eZ Zee 8 ms 
83 in/s 1953 Hz 0.5 ms 
248 in/s Sy 857 S) 82 io S 
1000 in/s ZS. On eZ 42.4 s 


Address Tracks: 
~ Each block space contains 184 address track bit spaces. 
-~ Two 16-bit words are read from each address blocks: 


16 bit block address, and 8 bit tape number + 8 bits 
of the block address 


~ Each address track bit occupies the same length of 
tape as one data track: 


Soemets (0n0053 in.) 


- The rate at which addresses may be read from the tape 
depends on the tape speed: (approx.) 
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Speed Address Interval 


5 in/s 190 ms 

83 in/s 12 ms 
248 in/s 4 ms 
1000 in/s 1 ms 


Pally Traek 


Each block space contains 184 tally track bit spaces. 


Bach block may contain up to 19 bytes of arbitrary 
information. The remaining four bytes are used for 
bit and block sync, block checksum, and inter-block 


gap. 


Each tally information byte includes seven (7) 
arbitrary information information bits and one parity 
[SiRe . . 


Each tally byte occupies the same tape length as one 
Gentrol Track cycle or eight Data Track. 


The tally track 1S written or read at either 5 in/s 
or 83 in/fs. The byte intervals for the two speeds 
are: (approx.) 


Seam /ss —ees ms 
83 inf/s - 0.5 ms 
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APPENDIX E 


eens eee, UPDATING TECHNIQUES 


A. FILE DIRECTORY RETRIEVAL 

The TBM* Memory System uses the index sequential method 
for data retrieval. The host sends a data set request to the 
TBM SCP (Supervisor Control Processer). The host then puts 
the user's job on an inactive status and continues with other 
processing. The SCP takes the data set request and searches 
its file directory. This search takes only 100 - 400 ms 
Since the file directory is kept on a 3330 disc. The file 
directory 1S multi-level with only the first level being a 
straight sequential search. If the first level is: 1) 
continuously updated, 2) kept continguously, and 3) not 
fragmented, then the first level search is fast. The first 
level gives a location on the disc for the next level of 
search. After the first level, there iS never a need to do 
a sequential search, since each succeeding level gives 
another address. When the data set description block is 
found in the file directory, the tape number, the location 
on the tape, and the disc space required is kept by the SCP. 
The disc space required is sent to the host and the search 
for the data set begins. 

The size of the file directory is moderate for a 3330 
disc. Even if a large data base of 10> files exists with 
each file address requiring 500 bytes in the file directory, 


(which should be more than enough), then a total of 5 x 10/7 


asi: 





bytes would be stored in the file directory, which is 


less than half of one 3330 disc pack. 


B. REBUCE HOST ACTIV EEY 

Using either single or dual interface buffers, TBM 
provides exceptionally rapid capability in the file update 
task. By knowing ahead of time which records are reguired 
by the host CPU, TBM can index these records, put the TBM 
block on disc, and let the host get just the record or 
records it needs. Updated records which will not fit back 
into their original space will be placed in overflow loca- 
tions with their positions noted. When TBM rewrites the 
disc buffer on tape, the proper sequence of the TBM blocks 
will be retained. This technique will dramatically reduce 
channel time, processor time and ultimately job time. For 
example, if the hit ratio is 1%, then the load on the host 
I/O channel is reduced by more than a factor of 100. 

Tllustrative Examples: 

Tape In - Tape Out Method: This is the conventional 
method of updating sequential files by creating a new file 
in each update process. Logical eeesiese are read onto disc 
where keys are examined. Records with modifications are 
transferred to the host CPU where processing is performed. 
Modified records and additions to the file are transferred 
from the disc through the EDCP in proper sequence and 
written by TBM. Records to be dropped from the file are 


Simply deleted in the disc. 
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Fixed length records are handled as illustrated in 
Figure 34. Records l, 2, 4, 5, and 7 are read on to disc. 
Only records to be modified are transferred from disc to 
the host CPU. When the host is finished processing, the 
modified records and the new records are written to disc. 
The inactive records never leave the disc. The records 
are written in sequence back to the TBM* Memory System. 

Variable length records that may change size are re- 
formatted on disc and are written out in sequence as 
illustrated in Figure 35. In this example, the updated 
records, #2 and #5, are expanded by the host CPU. The 
added portions of the updated records and the new records, 
Pe amemto) are written to disc bysthe hest CBU. The records 
are written in sequence on the TBM* Memory System. 

The same example as above is illustrated in Figure 36 


for the dual buffer case. 


Laks 





qOWId NI - dLvddn/aav pe eanbty 
GCYddWx x 


GaLvddi)s 


UAIING A/I WAL AIONIS 





Te ONG ie ace 


¥¥94xG’ xx ‘4xZ 


XZQNI MOOTE 


SNIWHO SNIWHO 


sqwoodd HLONAT daxIa 


Wat 






re Mee Sos ere § 





ear a ae 
XHGNI MOOTA | 


‘ 
ry 


4 





qOWId NI - gaLwdan/aawv G€ sanbhbty 


Gada x x 
YUdsAInd A/I WaL AIONIS Ga LVddn » 


¥¥9%xG 4 x4 lf‘ ¥Z 


(ZU) WV 
XHGNI WOOT SNIWHO 


SqdwOo0dd HLONAT PIAWIuwA 





eesvevd 


SNIWHO 


Wat 


BeGe 7G. i 


XACNI AOOT 





Bes 





see 4 99 "4G pac TO 


M 


SNIWHO 


Wat 


OmS pac’ Tt 


SUugggng Iwnd - aLvwdan/daaw 9€ Z9ANbSTy 


tee 
e+ Fei eee Fe 


XGONI AOOTE 


116 


SGUOOTY HLONEAT AIGVIYWA/GSaXIa 


aie 
ed 


XGHQNI MOOT 


SYUdaInd Twad - aLvddn/aav 


Codd» x 
daLvadns 
¥¥95 4G xx 04 %Z 
fad 
e ob em C 
LSOH 
o'? 





APPENDIX F 


wer APPLICATIONS FOR CP USERS 


Normally the quicker the response the better the system 
is. The trade off between response time and file storage 
system is usually dependent upon cost. In certain appli- 
cations, like terminal response in an interaction system, 
i.e., CP, a quick response time is a must. 

Certain configurations of the TBM* Memory System could 
meet the quick response criteria and yet maintain the files 
on TBM tape. The statistics of terminal usage may help to 
explain why this is possible. The cones data does not 
represent any one particular installation, but is an approxi- 
mation to a typical population of CP users. 

In a representative population of 1000 users, the number 
of active uSers each day varies between 50 and 75. These 
users pay for their CPU time and normally the amount of core 
memory used. Naturally they would want to sign on, run their 
program, and sign off as quickly as possible. After the 
penietia’ BunmeenG of a program, the user, wadienormally: 

1) Correct his program; 

2) Prepare new data; 

3) Prepare for a different program; 


4) Quit for the day. 
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Cenmclusions about CP users are as follows: 


1) Although the user population is large, the number 
of daily users is small. 


2) If a user signs on once during the day, he will 
probably sign on again. 


3) A typical user's CPU time per sign-on is from 1/2 
) minute to 5 minutes. 


4) The average file stored in the TBM* system is 
approximately 512000 bytes with a range from 128000 
By eee eo 2.5 megabytes 


5) The average number of data sets per job is two. 


6) The activity of daily users is heaviest during 
working hours as expected, (see Figure 37). 


7) The average number of megabytes required to store 
all of the personal files of each user is 1.2. 
See Figure 38 for the distribution. 

8) The maximum rate of initial sign-on is 20 per hour. 


The main stipulation 1s to meet the quick response 


requirement. The major time factors in the TBM* system are: 


1) the wait to get a transport driver; 

2) the search and read time. 
A proposed method, that was tested by the simulation model, 
is os qimlecate one or two transport drivers for CP filles 
only. The entire population of interactive files would be 
stored on TBM tape but spread out over as many tapes as 
necessary so as to allow CP files to be stored only on one- 
seventh of each tape. This has the effect of reducing search 
times to 3.5 seconds. See Figure 39 for a comparison of 


normal search and CP search. 
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The initial Sign-on by the user would have TBM* transfer 
all or a percentage of the user's files to disc and left on 
the disc for the remainder of the day. For initial sign-on, 
it might take up to one minute to load the user files. After 
initial sign-on, the files would be stored on disc and normal 
CP interface would take place. This saves disc space by not 
keeping all the inactive user's files on disc, yet allows 


relatively quick access by active CP users. 
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Figure 38 DISTRIBUTION OF PERSONAL CP USERS FILES 
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0 Avg. TBMTAPE Data Blocks 14600: 43800 


T(Search Avg.) = 16.5 seconds 


Figure 39 CP AND BATCH TAPE LAYOUT 
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APPENDIX G 


COSTS 


A. EQUIPMENT COSTS 

This section contains rough cost estimates that were 
obtained from Ampex Corporation. The TBM* Memory Soetem de 
normally examined for its suitability to the application and 
Situation. Very few mass storage systems are available on 
the market at the present time and each system has its 
advantages and disadvantages. If two systems are adaptable 
to the mass storage requirements of an installation, then 
competitive bidding usually is the method of winning the 
contract. Thus there is no open market price for the TBM 
system, only rough estimates based on very limited experience. 
The minimum TBM* Memory System has an initial installed cost 
of approximately $500,000. This is the minimum system which 
will hold approximately 0 bits of user data on-line. The 
modularity of the TBM system allows the installation to 
build upon this Sree in any manner to best suit their needs. 


The cost per additional device is estimated as: 


Transport Driver S00 O 
Dual Transport Module 75,000 
Data Channel 75,000 
Supervisory Control Processor 

(PDP-11 with Equipment) 75,000 

(PDP-11 without Equipment) 30,000 
External Data Channel 

Processer 50, OO 
TBM tape 

Not pre-tested L509 

Pre-tested 250 
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Ampex does not sell the 3330 discs or the channel 
Simulator which Ap eeuned to be in the installation's 
environment. 

The approximate cost, depending on maximum on-line 


storage, is given as follows: 


On-Line 

Storage Cost 

(Bits) Equipment (Installed) 
ioe TD, DTM, DC, SCP, EDCP $ 500,000 
10 TD, 11 DTM, DC, SCP, EDCP 1,250,000 
3 x 1ol* 6 TD, 36 DTM, 3 DC, 2 SCP, 


2 EDEP 3,250,000 


See the maximum system in Figure 2. 


B. MAINTENANCE COSTS 
The maintenance of the TBM* Memory System is normally 
done under contract with Ampex, although Ampex will train 
the host's maintenance people. An estimate of the maintenance 


cost is 1% of the initial installation costs per month. 


C.. MEDIA “COSTS | 

The cost of a TBM tape is less than $150.00. The capacity 
of a tape oe in excess of 46 billion bits resulting in a cost 
per bit of approximately 3 x 107? cents. Reusing the media 
a thousand times reduces the cost per bit by the same factor 
when comparing systems uSing non-erasable media, (see Figure 
40). 

The media cost (for tapes) for storing a trillion bits on 
TBM tape is less than 4000 dollars as compared to more than 


100,000 dollars using fully packed conventional computer tapes. 
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APPENDIX H 


Perol CUSTOMER RESULTS 


tie fest Cusmomer te install a version of the TBM* 


Memory System is connected with the D. O. D., but the name 


and location are classified. The system bought was a special 


purpose system. The hardware purchased included: 


‘1-2 


6. 


7. 


36 dual tape modules 

2 data channels 

4 transport drivers 

DEC PDP-9 Computer (Supervisory Control Processer): 


16K x 18 bit Core Memory 
Extended Arithmetic Element (EAE) 
Automatic Priority Interrupt (APT) 
Real Time Clock 
300 cps Paper Tape Reader 
50 cps Paper Tape Punch 
KSR 35 Console Teleprinter 
Multi-Station Teletype Control 
w/fone KSR 33 Teleprinter 
Four TU55 DECtape Transports, w/controller 


Extended Core Memory (ECM): 
32K x 32 bit External Core Memory 
Interface to PDP-9, program I/0, data channel I/0 
Interface to TBM* Memory System Data Channels 
(4 part) 
Special Interface to TCP's of TBM* Memory System 


I/0 Bus connects to DA1O (PDP-9 to PDP-10) 
interface to SCP 


Note that the SCP was not a PDP-1l and that ECM was used in X 


place of an EDCP. See Figure 41. 
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Figure 41 TBM* MEMORY SYSTEM AS CONFIGURED FOR THE FIRST 
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Maintenance 

The installation wanted its own people trained in system 
maintenance instead of contracting With Ampex to have main- 
tenance done. This required training twelve to fifteen 
qualified people in order to maintain a twenty-four hour 
Surveillance, seven days per week. The training period took 
approximately three months full time. 
Initial Testing 

The System was pre-tested at Ampex and re-tested at the 
installation. The test results were required to meet 
specifications. In 220 hours of testing within an eleven 
day span the following results were recorded as shown in 
the summary. 
Summary 

During these tests, the total number of bits transferred 
was 1.68 x nen All major TBM* Memory System functions, 
such as read, write, erase, search, update were tested during 
the period. The following is a summary of results. The 


oyeeem proved@to be very’ reliable. 





[TEM (CC. SPECIFICATION RESULTS 
Up Time: Transport 90% 98.5% 
HE amsport 

Controller 90% 97.0% 

Data Channel 903% 99.4% 

System 90% 98.33% 
Rereads per 1000 Pins. On37 
Demarked Blocks 13% O202 7 
Data Transfers per Block 1200 10 S200 tr 830 12 
Unrecoverable Errors Pine 2 oO Owain 1.4-<x a6 
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Level A Acceptance Test Summary 
Period: Day 1 - Day ll (220 hours of testing) 


Blocks Written: Multi-block Mode 100,620 
1l-block mode 92,524 
Total 193,144 
Blocks Read: Multi-block Mode 1,062,390 
l-block mode S63 550 
Total 1,426,129 
Re-Read Rate (#/1000) -Multierror Detected: 
Multi-block Mode (231) O22 
l-block Mode (302) 0.83 
Overall (533) 01.37 EL0C0 
(weighted 
; average) 
Unrecoverables - cannot get information 0 
Unrecoverable Procedures - (could not get 
by re-read. Had special process, 1.ée., 
clean tape 2 
Undetected Errors 0 
# Selective Updates 122355 
%$ Bad Erase/Writes (27/95234) 0.0282 
T5 & T13 Point-Point searches 15513 


Command Failure Rate: Multi-block Mode 
l-block Mode 
Overall (Weighted) 


Demarked Blocks on Tape: 


Pre-test 
Post-test 


29 


(2399731694 =97.57% 
(7782/621775)=1.25% 
(10181/653469)=1.56% 


(27/98170)=0. 028% 
(22/1109907)=0. 002% 
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